Does a Plane Imitate a Bird? Does Computer Vision Have to Follow Biological Paradigms?
نویسنده
چکیده
We posit a new paradigm for image information processing. For the last 25 years, this task was usually approached in the frame of Triesman’s twostage paradigm [1]. The latter supposes an unsupervised, bottom-up directed process of preliminary information pieces gathering at the lower processing stages and a supervised, top-down directed process of information pieces binding and grouping at the higher stages. It is acknowledged that these subprocesses interact and intervene between them in a tricky and a complicated manner. Notwithstanding the prevalence of this paradigm in biological and computer vision, we nevertheless propose to replace it with a new one, which we would like to designate as a two-part paradigm. In it, information contained in an image is initially extracted in an independent top-down manner by one part of the system, and then it is examined and interpreted by another, separate system part. We argue that the new paradigm seems to be more plausible than its forerunner. We provide evidence from human attention vision studies and insights of Kolmogorov’s complexity theory to support these arguments. We also provide some reasons in favor of separate image interpretation issues.
منابع مشابه
Robot Motion Vision Part II: Implementation
The idea of Fixation introduced a direct method for general recovery of shape and motion from images without using either feature correspondence or optical flow [1,2]. There are some parameters which have important effects on the performance of fixation method. However, the theory of fixation does not say anything about the autonomous and correct choice of those parameters. This paper presents ...
متن کاملRobot Motion Vision Pait I: Theory
A direct method called fixation is introduced for solving the general motion vision problem, arbitrary motion relative to an arbitrary environment. This method results in a linear constraint equation which explicitly expresses the rotational velocity in terms of the translational velocity. The combination of this constraint equation with the Brightness-Change Constraint Equation solves the gene...
متن کاملOnline multiple people tracking-by-detection in crowded scenes
Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...
متن کاملCultural Landscape of Nayband Village
Nayband is a historical village in South Khorasan, which is located on the western edge of the desert. The adjacency of the village to the caravan route, its strategic position, and the existence of water resources and fertile lands have led to the formation of its texture. The harsh climatic conditions of the region and the existence of miscreants and bandits in the historical periods, while c...
متن کاملConstruction Safety Visualization
Throughout the history of the construction industry, many fatalities and injuries have occurred in construction sites. One of the major causes of accidents is unsafe site conditions, which basically is due to inadequate supervision. To improve upon the traditional supervision approach, this study proposes a construction safety visualization approach. In this research paper, we provide a compute...
متن کامل